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Abstract 

In this paper we study saturated fractions of a two-factor design 
under the simple effect model. In particular, we define a criterion to 
check whether a given fraction is saturated or not, and we compute 
the number of saturated fractions. All proofs are constructive and 
can be used as actual methods to build saturated fractions. Moreover, 
we show how the theory of Markov bases for contingency tables can 
be applied to two-factor designs for moving between the designs with 
given margins. 

Keywords: Estimability; Linear model; Markov moves; Complete bi- 
partite graph. 

1 Introduction 

The search for estimable designs is one among the major problems in Design 
of experiments. Given a model, saturated fractions are fractions of a facto- 
rial design with as much points as the number of parameters of the model. 
As a consequence, all information is used to estimate the parameters, leaving 
no degrees of freedom to estimate the error term. Nevertheless, saturated 
fractions are of common use in sciences and engineering, and they become 
particularly useful for highly expensive experiments, or when time limita- 
tions force the choice of the minimum possible number of design points. For 
general references in Design of experiments, the reader can refer to [16] and 
[3], where the issue of saturated fractions is also discussed. 



1 



In this paper we restrict our attention to two-factor designs under the 
simple effects model, and we address the problem of studying the fractions 
with the minimal number of points and characterizing the saturated ones. 
As we discuss in this paper, this question is relevant for the simple effect 
model, as a randomly chosen minimal set of design points yields a singular 
model matrix with very high probability when the number of levels of the 
factors becomes large. 

Our approach is based on two main ingredients. First, we apply tools 
from Linear Algebra and Combinatorics to characterize the saturated frac- 
tions. Some notions, and in particular the definition of A:-cycle that we will 
present later, has already been considered in the framework of contingency 
tables in |14] for the definition of robust procedures for outliers detection 
in contingency tables. Second, we identify a factorial design with a contin- 
gency table whose entries are the indicator function of the fraction, i.e., they 
are equal to 1 for the fraction points and otherwise. This implies that a 
fraction can also be considered as a subset of cells of the table. Most of the 
techniques we present within the latter framework will be based on Algebraic 
statistics. The application of polynomial algebra to Design of experiments 
has already been presented in [15], but with a different point of view. The 
techniques used here are mainly based on the notion of Markov bases and 
were originally developed for contingency tables, both to solve enumeration 
problems and to make non-asymptotic inference, and to describe the geo- 
metric structure of the statistical models for discrete data. A recent account 
of this theory can be found in [7] . 

In this paper, we benefit from the interplay between algebraic techniques 
for the analysis of contingency tables and some topics of Design of experi- 
ments. From this point of view, the identification of a factorial design with 
a binary contingency table is an essential step. Some recent results in this 
direction can be found in pp. The connections between experimental de- 
signs and contingency tables have also been explored in [9], but limited to 
the investigation of enumerative problems in the special cases of contingency 
tables from the Sudoku problems. Here, we use a special class of Markov 
bases, named universal Markov bases, introduced in [17], and we show that 
the structure of these Markov bases is strictly related with the /c-cycles. 

The paper is organized as follows. In Section[2]we set some notations, we 
state of the problem, we define the A;-cycles in a fraction and we characterize 
them in terms of orthogonal arrays. In Section [3] we prove the main result, 
showing that the absence of a fc-cycle is a necessary and sufficient condition 
for obtaining a saturated fraction, while in Section @] we enumerate the 
saturated fractions showing that their proportion over the whole number 
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of fractions tends to zero as the number of levels increases. Section [5] is 
devoted to the computation of the relevant Markov basis for this problem 
and to showing its connection with fc-cycles. Finally, in Section [6] we suggest 
some future research directions stimulated by the theory presented here. 

2 Saturated designs and cycles 
2.1 Notations and basic definitions 

Let T> be a full factorial design with 2 factors, A and B, with I and J levels, 
respectively (I, J > 2), V = [I] x [J] = {1, ...,/} x {1, ... , J}. We consider 
a linear model on T>: 

Y i:j = mj + e itj for i G [I], j G [J] , 

where Yi j are random variables with means fiij and £tj are centered random 
variables that represent the error terms. In this paper we always consider 
the simple effect model, where the means /ijj of the response variable are 
written as: 

Hi j = n + ai + f3j for i G e [J] , (1) 

where [i is the mean parameter, and ctj and (3j are the main effects of A and 
B, respectively. 

We denote by p the number of estimable parameters. Therefore, p = 
I + J — 1 in the model of Equation ([T]). Under a suitable parametrization, 
the matrix of this model is a full-rank matrix with dimensions IJx (7+ J— 1). 
In this paper we will use the following model matrix: 

X = (m | ai | ... | a/„i \ h \ ... \ , (2) 

where tuq is a column vector of l's, a±, . . . ,a/_i are the indicator vectors 
of the first (/ — 1) levels of the factor A, and b\, . . . , are the indicator 
vectors of the first (J — 1) levels of the factor B. It is known that this matrix 
corresponds to the following reparametrized model: 

fjbij = (1 + c\i + f3j, for i G [I] , j G [J] ; 

/i = ^ + a/ + /3j, ai = a i -a I , = — fij . 

A subset J-, or fraction, of a full design T>, with minimal cardinality 
j^T = p, that allows us to estimate the model parameters, is a main-effect 
saturated design. By definition, the model matrix Xjr of a saturated design 
is non-singular. 



3 



X = 





1 




/1 .1 

'I z 


I), 

" 1 


'-'2 


















(1,1) 


n 


i 





1 





















(1,2) 


i 


i 








1 



















(1,3) 


i 


i 











1 




1 


"1 


0,2 


6i 


b 2 


b. 


(1,4) 


i 


i 














(1,1) 


/ 1 


1 





1 





°\ 


(2,1) 


i 





1 


1 








(1,2) 


1 


1 








1 





(2,2) 
(2,3) 


i 
i 






1 
1 






1 






1 


x (2 ' 2) 
(2,3) 


1 
1 






1 
1 






1 






1 


(2,4) 


i 





1 











(3,3) 


1 














1 


(3,1) 


i 








1 








(3,4) 
















0/ 


(3,2) 


i 











1 



















(3,3) 


i 














1 
















(3,4) 
















o) 

















Figure 1: The model matrix X of the full factorial 3x4 design and the 
model matrix Xjr of the fraction in Example 12.11 

Example 2.1. Let us consider the case 1 = 3, J = 4 and the fraction 

T = {(1,1), (1,2), (2,2), (2,3), (3,3), (3,4)} . 

The model matrix X of the full design and the model matrix Xjr of the 
fraction are given in Figured Is this case, det(Xjr) = 1. 

2.2 A;-cycles and orthogonal arrays 

As mentioned in the Introduction, in general, the problem of selecting satu- 
rated designs is non trivial and our case does not make exception. The key 
ingredient to characterize a saturated design for two-factor designs is the 
notion of cycle. Here we give a definition in term of Design of experiments. 

Definition 2.1. A k-cycle (k > 2) is a subset with cardinality 2k of a 
factorial design I x J with I, J > k where each of the k selected levels 
(among the I 's and J's) of each factor has exactly two replications. 

Example 2.2. Some examples of fractions with k-cycles are given in Figure 
As described above, we identify the design points with the cells of a 
contingency table in order to simplify the presentation. The corresponding 
fractions are: 

Fx = {(1,1), (1,3), (2,1), (2,2), (3,2), (3,3), (4,3)} , 

F 2 = {(1, 1), (1, 3), (2, 2), (2, 4), (3, 2), (3, 3), (4, 1), (4, 4)} , 

T 3 = {(1, 1), (1,3), (2,1), (2,3), (3,2), (3,4), (4,2), (4,4)} . 
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Figure 2: A 3-cycle (left) and two 4-cycles (middle and right). 



Notice that F\ contains a 3-cycle, Ti and F$ contain a 4-cycle. In F3 the 
4-cycle can be decomposed into two sub-cycles. 

In a natural way, the coordinate points of the design T> can be considered 
as the rows of a jf-T) x d matrix, where d is the number of factors. With a 
slight abuse of notation, we still call this matrix design . The same holds for 
fractions. In order to analyze the role of the ^-cycles within the framework of 
Design of experiments we recall here a combinatorial definition of orthogonal 
array, see p3] and [8]. 

Definition 2.2. A fraction J 7 of a design I\ X ■ ■ ■ X Id with j^F = n is 
an orthogonal array of size n and strength t if, for all t-tuples of its fac- 
tors Fi ± , . . . , Fi t , all possible combinations of levels in [IjJ x • • • x [Jj t ] appear 
equally often. We denote such an orthogonal array with OA(n; (ii, . . . , Id);t). 

Proposition 2.1. A k-cycle (k>2) is: 

1. an OA{2k; (k, k);t) where t = 2 if k = 2, and t = 1 if k > 3; 

2. the union of two disjoint orthogonal arrays OA{k; (k, k); 1). 

Proof. 1. This fact follows from Definition 12.11 In particular, for k = 2 
T coincides with the full factorial design 2 2 . 

2. We construct two disjoint fractions OA\ and OA2, OA\ U OA2 = 
J-, iteratively. Starting from a given point of the fraction, we assign 
alternatively to OA\ and OA2 the points of the fraction, choosing the 
first or the second factor. 

• Choose a point of F, say (ii, ji), and assign it to OA\. 

• Consider the unique point of F with the same level for the first 
factor, (ii, J2), with j'2 7^ ji, and assign it to OA2. 

• Consider the unique point of F with the same level for the second 
factor, (i2, j'2), with 12 H, and assign it to OA\. 



• Consider the unique point of T with the same level for the first 
factor, (12,33) with j'3 7^ j 2 ^ j\ and assign it to OA 2 . 

• And so on, until the unique point to choose is already assigned. 

If not all points of the fraction have been assigned, i.e. if the fraction 
contains sub-cycles, it is enough to start the procedure above on the 
remaining points, until all the points are assigned. In this way both 
0A\ and OA 2 have, by construction, exactly one replicate for each of 
the k levels of the two factors. 

□ 

Example 2.3. We show how the decomposition of a fraction into two or- 
thogonal arrays works on a A-cycle. Let I = J = 4, and consider the A-cycle 

T 2 = {(1, 1), (1, 3), (2, 2), (2, 4), (3, 2), (3, 3), (4, 1), (4, 4)} 

already considered in Example \2.3\ and displayed in Figure The relevant 
orthogonal arrays are: 

OA 1 = {(1,1), (2,2), (3,3), (4,4)} , 
0A 2 = {(1,3), (2,4), (3,2), (4,1)} . 

To determine the number of /c-cycles we need the notion of derangement. 
A derangement is a permutation such that no element appears in its original 
position. 

Proposition 2.2. The number of k- cycles is 

kUk 

where Ik denotes the number of derangements of k elements. 

Proof. Let us consider OA\ and 0A 2 as in Proposition 12.11 0A\ represents 
a permutation 7Ti of \k\. The fraction OA 2 represents a derangement of 

7Ti([fc]). □ 

To actually compute \k, recall that \k can be approximated by \k\je + 
0.5J, where [ - J is the floor function. For more details on this theory, see for 
instance [12] . 
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3 fc-cycles and saturated fractions 



As mentioned in the previous sections, the connections between saturated 
designs and cycles have been explored in a slightly different framework, 
in the study of robust estimators in contingency tables analysis, see [14] . 
Nevertheless, we restate here the relevant theorem within the language of 
Design of experiments and we give the proof, as its main algorithm will be 
used later in the paper. 

Theorem 3.1. A fraction T with p = I + J — 1 points yields a saturated 
model matrix if and only if it does not contain cycles. 

Proof. In view of Proposition 12. 1] a cycle can be decomposed into two dis- 
joint orthogonal arrays, OA\ and OA2, of k points each. 

=> Suppose that J- contains a cycle. When we sum the rows of the model 
matrix Xj with coefficient +1 for the points in the OA\ and with coef- 
ficient — 1 for the points in OA2, we produce a null linear combination 
and therefore the determinant of Xj- is zero. 

Suppose that Xjr is singular, i.e., there exists a null linear combination of 
its rows with coefficients that are not all zero. Denote by ru^ the row 
of the model matrix corresponding to the point of the fraction. 

Therefore, we have 

7i?"(i 1 j 1 ) + ---+7pr(Wp) =0 ( 3 ) 

and the coefficients 71, • • • , 7 P are non all zero. Without loss of gener- 
ality, suppose that 71 > 0. As the indicator vector of the ii-th level of 
the first factor is in the column span of Xj, and the same holds for 
the ji-th level of the second factor, we must have: (a) a point with 
the same level for the first factor, say (^1,^2), with negative coefficient 
in Equation ([3|); (6) a point with the same level for the second factor, 
say (23, Ji), with negative coefficient in Equation ([3]). Therefore, there 
must be a point with level i^ for the first factor and a point with level 
j2 for the second factor with positive coefficients. Now, two cases can 
happen: If the point (13, j%) is a chosen point and its coefficient in 
Equation ([3]) is positive, we have a 2-cycle; otherwise, we iterate the 
same argument, and we yield a fc-cycle, with k > 2. 

□ 
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4 The number of saturated designs 

In this section we study the structure of the saturated fractions described 
in Section [2 

Definition 4.1. Given a fraction T , we define its margins niA = (jnA,i, ■ ■ ■ ■> mA,i) 
and me = (?tjb,i, ■ ■ ■ , m^j) where: 

mA,i = ^2 ( dl = f° r i G W ' 

(di,rf 2 )GJ' 
(di,d 2 )eJ 7 

where (•) denotes the indicator function. 

Notice that j is the number of the occurrence in T of the i-th level 
of the factor A. For example, the following saturated design 

F = {(1, 1), (1, 2), (2, 1), (2,4), (3,2), (3, 3), (4,4)} . 

has margins tua = (2, 2, 2, 1) and = (2, 2, 1, 2). 

The following lemmas for I x I designs will be used later in the proof of 
the main result of this section. 

Lemma 4.1. Let Ti be a saturated I x I design. Its margins satisfy the 
following conditions: 

1. mA,+ = mB,+ = 21 — 1 where + denotes the summation over the 
corresponding index; 

2. rriA,i > 1, m B ,j > 1 for all i,j G [I]; 

3. there exist i,j G [I] such that mA,i = m B,j = 1/ 

4- let i+ G [I] be an index such that mAi* = 1- Let (?*,_?*) be the only 
point (di,d,2) of Ti such that d\ = i+. Then msj^ > 1- 

Proof. 1. It is immediate to see that mA,+ = Yll=i d 2 )£J r ( dl = *) = 
#•7-/ = 27 — 1 and the same holds for i?. 

2. Refer to the matrix representation in Equation [2j By absurd, suppose 
that there exists an index i such that m^i = 0. We distinguish two 
cases: if i < / — 1 then has a null column, corresponding to a^; 
if i = J, the sum of the columns ai, . . . , ar_i is equal to mo- In both 
cases, Xj^j- is singular. The same applies to B. 
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3. This point follow immediately from items Q] and [2j 

4. For sake of readability, we suppose that i* = I and j+ = I. Thus, the 
sum of a%, . . . , ai-\ is equal to the sum of b\, ... , bi-\. Hence, Xjr I is 
singular. 

□ 

Remark 4.1. The four conditions in Lemma \4.1\ are not sufficient for char- 
acterizing the saturated fractions. A simple counterexample is the following 
fraction of a 5 x 5 design: 

{(1, 1), (1, 2), (2, 1), (2, 3), (3, 2), (3, 3), (4, 4), (4, 5), (5, 4)} , 

that is not saturated, as it contains a 3-cycle. 

In the following result we analyze how the saturation property is pre- 
served when we add one level to each of the two factors, moving from an 
I x I design to an (/ + 1) x (7 + 1) design. 

Lemma 4.2. Let Ti be a saturated Ixl design, and define an (J+l)x(/+l) 
design containing Ti as: 

= Ti U E i+1 with J> n E I+1 = . 

Then J-j+i is saturated if and only if Ei + \ has exactly two points, chosen 
in the union of the (I + l)-th row with the (I + l)-th column of J- with 
the conditions m,A,i+i > 1 and msj+i > 1- 

Proof. Ej+i must contain exactly two design points, as i^Ti = 21 — 1 and 
ftj-i+i = 21+1. If one or two points are not in the union of the (/ + l)-th 
row with the (J+ l)-th column of J~i+i, there is a contradiction with Lemma 
14.11 If both points are chosen in the (J+ l)-th row and in the first / columns, 
the margin itib,i+i = 0, a contradiction. With an analogous proof, the two 
points can not be chosen in the (/ + l)-th column and in the first / rows. 
All the remaining cases are valid choices, as one among uiaj+i and msj+i 
is equal to 1 and therefore no cycles can appear. □ 

We are now ready to approach the problem of computing the number of 
saturated fractions. 

Proposition 4.1. Given mA = (^4,1, ■ ■ ■ , thaj) andms = (mB,i, . . . ,rriB,j) 
with tua,+ = mB,+ = I + J — 1, mA,i > l,i € [I] and m#j > l,j € [J], the 
number of saturated designs with margins equal to mA and tub is 

( '- 1 V J_1 

\ m B ,i m B ,j - 1 J \ m A ,i - 1, - - - , m A ,i - 1 




9 



Proof. First we consider J = I. Without loss of generality, we can assume 
that the margins of the fraction are arranged in the form: 



Since rriAi = 1, we can choose a point for the last row, but we have to 
exclude all design points (I, h) with m,B,h = 1, in order to satisfy the condi- 
tion in Lemma I4.lt item HI In the same way, we choose a point in the last 
column. 

We repeat the same argument on the (I — 1) x (I — 1) design obtained 
by deletion of the last row and of the last column. It is immediate to see 
that both margins of such design have a component equal to 1. Hence, we 
iterate (/ — 2) times the procedure above, until we have a degenerate lxl 
design with 1 as its unique element. 

If we analyze this algorithm backward, we note that at each step we add 
two points according to the rule in Lemma [4. 2 1 and therefore the constructed 
fraction is saturated. Thus, the procedure generates 



as each row can be chosen until the margin decreases to 1, and the same 
holds for columns. 

Now, consider J > I. The .B-margin can be arranged in the form: 

■m B ,\ > m B ,2 > ■ > m B j = 1 = m B ,i+i = 1 = • • • m B)J = 1 . 

With this ordering of the margins, the square I x I table on the left can be 
analyzed as in the previous case, while for the last J — I columns there is 
only the constraint given by the ^-margin, because the S-margin is always 
equal to 1, and therefore no fc-cycles can appear. Hence, the formula in (j4]) 
is proved. □ 

Theorem 4.1. The number of saturated I x J designs is P J_1 ' J^~ l \ 

Proof. It follows from Proposition 14. 1 1 and the classical multinomial theorem, 
by summation of all possible terms (see Equation UD corresponding to all 
possible margins. □ 

We notice that the key consequence of the above enumerations is that 
the proportion of singular design matrices is not negligible, and it becomes 
as large as / and J increase. 



m A ,i > m A ,2 >■■■> m A ,i = 1 , 
mB,i > m B)2 > • • • > m B j = 1 . 
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Corollary 4.1. Let us randomly choose Jc Ix J with f^T = I + J — 1. 
XTie probability that J 7 is a saturated I x J design is 



and if tends to as I and J goes to infinity. 

For instance, let us consider I = J. For I = 3 we obtain a saturated 
design in 64% of cases, for I = 4 in 36% of cases, for / = 5 in 19% of cases, 
while for I = 6 in 10% of cases. Hence, the characterization of non-singular 
designs, as given in Theorem[37TJ is useful from an algorithmic point of view, 
because the random choice of a subset with I + J — 1 points does is not an 
efficient procedure. 

Example 4.1. We discuss here the case I = J = 4 extensively. The number 
of saturated designs is 4 6 = 4096, corresponding to 36% of designs with 7 
points. The possible configurations of margins, up to the permutation of the 
levels, are: (4,1,1,1), (3,2,1,1) and (2,2,2,1). The table below shows the 
number of saturated design with such margins for one factor, see Proposition 
\4-l\ the number of multiset permutations of such configurations of margins 
and the product of them. 



Margin 


# from Prop. \4-l\ 


Multiset permutations 


Total 


(4,1,1,1) 


(3,0,0,0) 


(i 4 s) 


4 


(3,2,1,1) 


(2,1,0,0) 




36 


(2,2,2,1) 


(1,1,1,0) 


( 3 4 i) 


24 



The table below shows the number of saturated design with respect to the two 
margins. 



Margins 


(4,1,1,1) 


(3,2,1,1) 


(2,2,2,1) 


(4,1,1,1) 


16 


144 


96 


(3,2,1,1) 


144 


1296 


864 


(2,2,2,1) 


96 


864 


576 



Finally, Figure \4~l\ shows the saturated Ax A designs identified by contingency 
tables. For each margin configuration, a representative of each equivalence 
class of tables is displayed. The equivalence is up to permutation of rows, 
permutation of columns, and transposition. We notice that in two cases 
there is more than one equivalence class. 
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Figure 3: Non-equivalent saturated 4x4 designs classified by the margins. 

5 Markov bases for I x J designs 

Proposition 12.11 and Theorem 13. II lead to an interesting connection with the 
theory of Markov bases for this kind of experimental design. In fact, we have 
already identified the two- factor design with a binary Ix J contingency table. 
Under such representation, we define the matrix N^F), where N(J r )i t j = 1 if 
is a point of the fraction and N(J r )ij = otherwise. Using Algebraic 
statistics tools, we are able to generate all fractions with given margins 
through a Markov chain algorithm following the theory in |17j . 

We briefly recall the basic facts about Markov bases. The reader can refer 
to the book [7] for a complete presentation. A Markov move m is a table 
with integer entries such that N^J 7 ), N{J r )+m and N{J : ) — m have the same 
margins. A Markov basis Ad is a finite set of Markov moves which makes 
connected the set of all tables, or designs, with the same margins. Notice 
that by adding Markov moves to a Markov basis yields again a Markov basis. 

If we start from a fraction with matrix N^q), the Markov chain is then 
built as follows: 

• at each step i, randomly choose a Markov move m in KA and a sign 
£G {±1}; 

• if N(J-i) + em is a binary table, move the chain to A^J^+i) = N(J-i) + 
em; otherwise, stay in N(J-i). 
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Figure 4: The complete bipartite graph for a 3 x 4 design. 



The Markov chain described above is a connected chain over all the designs 
with fixed margins, and its stationary distribution is the uniform one. By 
considering the classical Metropolis-Hastings probability ratio, one can de- 
fine a Markov chain converging to any specified probability distribution, see 
®. 

The theory in [T7], Sections 4 and 5, states that the relevant Markov 
basis for our problem can be computed from the complete bipartite graph 
of the design. The complete bipartite graph has one vertex for each level of 
A, one vertex for each level of B and one edge connects each A- vertex with 
each i?-vertex. A circuit of degree A: is a closed path with 2k vertices, and 
with edges 

(k,h), Uuk), O2, J2), • • • , (ik,3k), (jk,h) , (5) 

where . . ,ik are distinct ^-indices and ji, . . . , jk are distinct B-indices. 
A concise presentation of the theory of bipartite graphs can be found in |20j . 
The complete bipartite graph for the 3x4 designs is depicted in Figure [5) 
Then, the Markov basis for our problem is defined by associating a 
Markov move to each circuit of the complete bipartite graph. Such move 
has entry 1 for each edge in even position in the sequence © , and has entry 
— 1 for each edge in odd position. 

Proposition 5.1. The k-cycles, decomposed into two orthogonal arrays as 
in Proposition \2. 11 form a Markov basis. 

Proof. It is enough to observe that each circuit of degree k naturally defines 
a A;-cycle decomposed as in Proposition 12.11 □ 
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Example 5.1. We now discuss the 3x4 case extensively. The complete 
bipartite graph in Figure^has 12 circuits with 4 edges and 12 circuits with 
6 edges. Thus, the Markov basis is formed by: 

• 18 moves of the form 

m = I -1 1 I (6) 




corresponding to the 2- cycles; 
• 24 moves of the form 

m= ^-1 1 ) (7) 

corresponding to the 3-cycles. 

The moves in Equations ([6]) and ([7]) are derived from the circuits (1, 1), (1, 2), (2, 2), (2, 1) 
and (1, 1), (1, 2), (2, 3), (3, 3), (3, 2), (2, 1), respectively. 
For instance, starting from the fraction with table 

N(7) = 

we see that no move produces a valid binary table. Thus, we conclude that 
iV^J 7 ) itself is the unique fraction with such margins. On the other hand, 
starting from the fraction with table 




N(T) = 

with the Markov moves, we can reach the other two tables 






Notice that a Markov basis preserve the margins, but it generates all the 
fractions, no matter if they are saturated or not. For example, the fraction 



N{7) 
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represents a saturated design, while 



/O 
N(T) + 1 -10 
\0 -1 1 



does not. 



The number of fc-cycles increases fast with / and J, and the computa- 
tions become unfeasible for large / or J. However, with some additional 
hypotheses on the fractions, for example assuming strict positivity of all 
margins, the connectedness of the Markov chain is ensured by a subset of 
circuits. That computational issues are studied in [11] and [18j . 

The results above yield another method for enumerating the saturated 
fractions. In fact, from Theorem 13.11 a saturated fraction must have size 
equal to / + J — 1 and it must not contain cycles. For those readers familiar 
with the language of graph theory, this implies that, in terms of the com- 
plete bipartite graph, the subgraph corresponding to the fraction must be a 
spanning tree. A known result in graph theory, named as the Cayley's for- 
mula, says that the number of spanning trees in a complete bipartite graph 
is I J ~ l J I ~ 1 . This represents an alternative proof of theorem 14. 1[ For details 
on complete bipartite graphs and the Cayley's formula see [19j . 

6 Conclusions 

The theory described in this paper suggests several extensions and appli- 
cations. First, it is interesting to explore how the results can be extended 
for the characterization of saturated fractions to more general designs. In 
the multi-way setting the generalization of /c-cycles is not trivial and more 
advanced notions of Algebraic statistics must be used. Some preliminary 
examples are being explored by the authors and the findings seem fruitful. 
Second, the connections between fractions and graphs, as suggested in the 
last paragraph of Section [5j can lead to new useful results. Furthermore, 
it would be interesting to study the classification of the saturated fractions 
with respect to some statistical criteria. Among these criteria, we cite the 
minimum aberration in a classical sense, or more recent tools, such as state 
polytopes. Minimum aberration is a classical notion in this framework, and 
is supported by a large amount of literature. This theory has been devel- 
oped in [10] and more recently in, e.g., [21] with the use of the indicator 
function in the two-level case. The extension to the multilevel case is cur- 
rently an open problem. The use of state polytopes has been introduced 
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in [5]. From the point of view of applications, the use of the inequivalent 
saturated fractions to perform exact tests on model parameters is worth 
studying, together with its implementation in statistical softwares, such as 
SAS or R. For inequivalent orthogonal arrays very interesting results have 
already been achieved, see [I] and [2]. 
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